# AWQ Quantization
Deepseek R1 0528 Qwen3 8B AWQ 4bit
MIT
The AWQ quantized version of DeepSeek-R1-0528-Qwen3-8B, suitable for efficient inference in specific scenarios.
Large Language Model
Transformers

D
hxac
179
2
Meta Llama 3.3 70B Instruct AWQ INT4
Llama 3.3 70B Instruct AWQ INT4 is the 4-bit quantized version of the Meta Llama 3.3 70B Instruct model, optimized for multilingual dialogue use cases and text generation tasks.
Large Language Model
Transformers Supports Multiple Languages

M
ibnzterrell
6,410
22
Biomistral 7B AWQ QGS128 W4 GEMM
Apache-2.0
BioMistral is an open-source medical domain model suite based on the Mistral architecture, further pretrained using PubMed Central open-access text data.
Large Language Model
Transformers Supports Multiple Languages

B
BioMistral
224
5
Featured Recommended AI Models